---
title: LlamaEdge
---

>[LlamaEdge](https://llamaedge.com/docs/intro/) is the easiest & fastest way to run customized
> and fine-tuned LLMs locally or on the edge.
>
>* Lightweight inference apps. `LlamaEdge` is in MBs instead of GBs
>* Native and GPU accelerated performance
>* Supports many GPU and hardware accelerators
>* Supports many optimized inference libraries
>* Wide selection of AI / LLM models



## Installation and Setup

See the [installation instructions](https://llamaedge.com/docs/user-guide/quick-start-command).

## Chat models

See a [usage example](/oss/integrations/chat/llama_edge).

```python
from langchain_community.chat_models.llama_edge import LlamaEdgeChatService
```
